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Optimal  Sequential  Selection  of  a  Monotone 
Sequence  from  a  Random  Sample 


By 

Stephen  M.  Samuels  and  J.  Michael  Steele 

1.  Introduction. 

A  central  theme  in  the  theory  of  optimal  stopping  is  that  many 

stochastic  tasks  can  be  performed  almost  as  well  by  someone  unable  to 

foresee  the  future  as  by  a  prophet.  In  one  classic  example,  the 

"secretary  problem",  the  task  is  to  stop  at  the  largest  of  n  sequentially 

observed  independent  identically  distributed  observations  X^,Xg, . . .  ,Xn« 

Without  clairvoyance  one  attains  where  t  is  some  stopping  time, 

but  a  prophet  is  always  able  to  attain  max  X. . 

l<i  <n  1 

That  the  prophet's  advantage  is  rather  modest  follows  from  the  well 
known  fact  that  no  matter  how  large  n  is  there  is  a  stopping  time 
for  which 


(1.1)  P(X  =  max  X.  )  >  e-1  , 

n  1  <  i  <  n  1 

(see,  e.g.  Gilbert  and  Mosteller  (1966)). 

The  stochastic  task  we  consider  here  is  more  complex  than  the 
secretary  problem  and  the  central  theme  is  illustrated  in  a  different 
way  from  (l.l).  To  set  the  problem,  let  (X,:  1  <  i  <  <»},  denote 
independent  random  variables  with  continuous  distribution  F.  The 
basic  object  of  interest  is 
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L  =max(k:  X.  >  X.  >***>X.  with  l<i1  <i5<  •  •  •  <i,<  n)  , 
n  h  *2  k  -  1  2  li¬ 

the  length  of  the  longest  monotone  decreasing  subsequence  of  the  sample 
{X^Xg, . . .  ,Xn) .  (We  could  equally  well  have  considered  increasing  sub¬ 
sequences  but  the  notation  will  be  simpler  this  way. )  The  variable  Ln 
has  been  studied  extensively  and  it  is  now  known  that 

(1.2)  ELn  ~  2/i i  . 

Ihe  first  result.  EL  ~  c/n,  was  obtained  by  Hammersley  (1972)  via 
an  ingenious  use  of  the  planar  Poisson  process.  Baer  and  Brock  (1968)  had 
conjectured  earlier  on  the  basis  of  computer  simulations  that  c  =  2.  By 
a  delicate  variational  argument  Logan  and  Shepp  (1977  )  proved  that  c  is 
at  least  2  and  by  a  similar  method  Vernik  and  Kerov  (1577 )  established 
that  c  equals  2. 

How  well  can  one  sequentially  choose  a  monotone  decreasing  subsequence 

using  only  stopping  times?  Formally,  we  call  a  sequence  of  stopping  times 

a  policy  if  (1)  they  are  adapted  to  (X^:  1  <  i  <  00) ,  (2 )  1  <  <  •  •  • 

and  (3 )  X  >  X  >  •  •  •  .  Hie  class  of  all  policies  is  denoted  by  si 
T1  t2 

and  our  main  problem  is  to  determine 

u  =  sup  E(max(k:  t.  <  n) )  . 

n  0  k  ™ 

T€* 


Die  quantity  u  is  the  largest  expected  length  of  a  monotone  decreasing 
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subsequence  which  can  be  achieved  by  sequential  selection.  Although  ur 
would  a  priori  depend  on  F,  one  can  easily  check  that  it  is  the  same 
for  all  continuous  F.  Moreover  the  optimal  policy  for  a  given  F  can 
be  obtained  via  the  probability  integral  t  reins  form  from  the  policy  for 
the  uniform  distribution  on  [0,1]. 

Our  main  result  is  the  following 

THEOREM. 

(1.3)  un  ~  . 

The  intuitive  content  of  this  result  is  another  illustration  of  the 

central  theme;  the  prophet  asymptotically  outperforms  the  intelligent 

(but  non -clairvoyant )  individual  by  only  a  factor  of  -/2  ,  One 

should  also  note  that  the  naive  individual  who  too  eagerly 

reports  each  successive  record  low  achieves  an  expected  length  of  only 
n 

£  l/k  ~  log  n  and  thus  does  much  worse  than  the  prophet  or  intelligent 
k=l 

individual. 

Our  proof  that  un  ~  </2ri  begins  with  a  simple  algorithm  for  computing 
un  based  on  an  integral  equation  obtained  by  dynamic  programming.  Standing 
alone,  the  integral  equation  seems  to  be  ineffective,  so  in  Section  3  we 
prove  by  a  subadditivity  argument  that  un//n  has  a  limit.  A  sequence  of 
efficient,  but  suboptimal,  policies  ire  then  given  in  Section  4  which  show 
lim  un//n  >  ^2.  Hie  crux  of  the  proof  is  in  Section  5,  where  the  integral 
equation  is  finally  used  to  show  lim  y\  /Jn  <^2  after  first  establishing 
an  essential  regularity  property  of  the  solution  by  a  probabilistic  argument. 
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The  proof  outlined  above  yields  several  results  en  route*  In 
particular,  we  obtain  results  on  optimal  selection  when  the  sample 
size  is  random.  These  results  as  well  as  comments  on  a  related 
problem  are  collected  in  Section  6. 

In  the  final  section  we  are  fortunate  to  be  able  to  include  a  result 
due  to  Burgess  Davis  on  the  sequential  selection  of  a  decreasing  subsequence 
from  a  random  permutation.  This  result  teams  up  with  the  main  theorem  of 
this  paper  to  settle  a  second  conjecture  given  in  the  computational  paper 
of  Baer  and  Brock  (1968).  We  would  like  to  thank  Professor  Davis  for  his 
kind  suggestion  that  his  result  be  included  in  the  present  paper. 

2.  An  Algorithm  for  Computing  the  Optimal  Expected  Length. 

First  of  all,  as  we  remarked  in  the  Introduction,  we  may  assume 
without  loss  of  generality  that  the  common  distribution  of  the  observa¬ 
tions  X1,X2,...  is  uniform  on  (0,1). 

Let  us  define,  for  each  t  e  (0,1), 

(2.1)  =  (t- (t^Tg,...)  ejf  :  X  <  t  ,  i  -  1,2,...), 

the  class  of  policies  which  only  select  observations  smaller  than  t. 

We  jalso  let 

(2.2)  u  (t)  =  SUP  .  E(max{k:  t  <  n))  . 

n  tej  k  - 

t 

Clearly 


(2.3) 


u  -  u  (1) 
n  n 
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(2.4) 


(t )  *  P(X^  <_  t)  *  t. 

We  also  record  the  trivial  fact:  uq(0  *  0* 

Now  fix  t  and  consider  n+1  available  observations.  Because  of 
the  stationarity  of  the  X^s,  the  maximal  conditional  expected  sub¬ 
sequence  length,  given  X^,  will  be  just  uR(t)  if  X^  is  not  selected 
and  1+u^X^  if  X^  is  selected  (in  which  case  necessarily  X.^  <  t). 
Since  the  optimal  policy  must  do  whichever  maximizes  the  conditional 
expectation,  we  have  the  algorithm: 


(2.5) 


un+i  (t)  =  un(t)P(X1>  t)  +  Emax(un(t),l+un(X1))l(X^< 


max(un(t),l+un(s  )}ds  . 


It  follows  easily  that  for  each  n  and  t,  the  policy  which  achieves 
un(t)  is 


(2.6) 


i 

! 


•tL  =  min{i:  <  t  and  l+un_^  (X^ )  >  uQ-i(t)) 

>  n(arbitrary)  if  no  such  i  <  n 

Vi  =  “in{i  >  V  Xi  <  XTk  6X1(1  1+un-i(Xi}  -  Vi(XTk)] 
>  n (arbitrary)  if  no  such  i  <  n  or  if  >  n . 


Since  (2.4)  and  (2.5)  imply  that  each  uR(* )  is  strictly  increasing  on 
(0,1),  we  can  implicitly  define  the  functions  tR(n)  by  the  following  relations 
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(2.7) 


u  (t*(t))  ■  U  (t)-l  if  u  (t)  1 

n  n  n  n 

t*(t)  *  0  if  un(t)  i  ^  • 

Equation  (2.6)  now  simplifies  to  the  following: 

(2.8a)  =  min(i:  t*_^(t)  <  <  t) 

(2.8b)  Tk+1  =  min{i  >  xk:  t*_±(X  )  <  X.  <  X  )  . 

One  would  naturally  like  to  show  ur  ~  -J2ri  directly  from  (2.  5), 
but  this  does  not  seem  possible.  Hie  integral  equation  (2.5)  becomes 
effective  only  after  substantial  quantitative  information  about  u^(t ) 
is  obtained. 

Here  we  should  remark  that 

(2.9)  u  (t)  =  fk*1  |  (j)tJ(l-t)n_’j 

k=l  j=k  J 

for  all  t  small  enough  so  that  the  right  side  of  (2.9)  is  less  than  or 
equal  to  one.  This  can  be  shown  either  directly  from  the  integral  equation 
or  by  noting  that  when  uR(t)  <  1  the  optimal  policy  is  to  select  all 
successive  minima  among  those  X^'s,  i  =  1,2, ...,n,  which  are  smaller 
than  t.  The  right  side  of  (2.9)  is  the  expected  number  of  such  minima. 

3.  Existence  of  the  limit. 

To  show  that  un//n  has  a  limit,  we  shall  first  prove  that  a  limit 
exists  for  an  analagous  planar  Poisson  process  problem  and  then  show  that 
the  two  problems  are  asymptotically  similar.  The  proof  is  a  version  of 
Ham■llerale!y,  s  subadditivity  idea  made  somewhat  simpler  because  we  deal 
only  with  expectations  rather  than  with  random  variables. 
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3  a.  1116  Planar  Poisson  Process  Problem. 

Let  Z^,Zg,i  •  •  • ,  be  I  ID,  each  exponentially  distributed  with  mean 
one,  and  independent  of  the  X^'s  which  are  IID  uniform  on  (0,1). 

Let  be  the  class  of  policies  r  =  (f^Tg,. . .  )  with 

(a)  each  adapted  to  ( ( Z^^ , ) :  1  <  i  <  oo) 

(b )  1  <  t.  <  t0  <  •  •  •  and  X  >  X  >  •  •  •  : 

-  1  2  T1  t2 

and  let 

Tk 

w(\)  =  sup  t  E(max  k:  5*  Z.  <  \  )  . 

TeJZ  i=l  1 

In  other  words,  we  observe  a  Poisson  process  with  arrival  rate  one, 
on  an  interval  of  length  At  each  arrival  time  we  are  allowed  to 

observe  a  random  variable  uniform  on  (0,1)  and  independent  of  its 
predecessors,  and  the  object  is  to  select  a  decreasing  subsequence  of 
maximal  expected  length. 

What  makes  this  problem  so  appealing  is  the  well-known  fact  that, 

if  we  choose  p  and  t,  each  in  (0,1),  then  the  following  two  processes 
are  also  Poisson: 

(a)  Those  arrival  times  in  (0,p\)  for  which  the  corresponding 
X^' s  are  >  t, 

(b)  Those  arrival  times  in  (p\,\)  for  which  the  corresponding 
X^'s  are  <  t. 
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Those  processes  have  expected  numbers  of  arrivals  p(l-t)\  and  (l-p)tk 
respectively.  It  follows,  by  considering  the  subclass  of  J  consisting 

Lt 

of  those  t's  with 


that 


X_  >  t  if  l  z.  <  pk 
Ti  j=l  3 


<  t  if  £  z,  >  P\  , 

d=i  3 


(3*1)  v(\)  >  w(p(l-t)\)  +  w((l-p)t\)  . 

If  we  now  define 

p(x)  =  w(x2) 

p 

and  choose  \  =  (r+s)  t  p  =  l-t  =  r/(r+s),  (3.1)  becomes 
(3.2)  p(r+ s)  >  p(r)  +  p(s)  . 

E|y  the  elementary  lemma  on  subadditive  sequences  (Fekete  (1923),  or 
Khinehin  (1957 ) )>  this  implies  the  existence  of 


(3*3)  7  =  lim  p(t)/t  =  lim  sup  p(t)/t  >  1  . 

t  oo 


The  finiteness  of  7  follows  from  the  inequality 


w(\ )  <  £  jr  EL  <  M  l  g-  v'K  <  M  } 

k=0  *  k=0 
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the  second  inequality  holds  for  some  constant  M  by  (1.2).  The 


last  inequality  is  just  an  application  of  the  familiar  inequality 

e|x|  <  (ex2)1/2  . 

Thus  we  have  shown  that  there  is  a  finite  y  >  0  such  that 


O.M  7  =  lim  w(t2)/t 

t  -•  00 

=  lim  . 

t  OO 

3b.  Asymptotic  Similarity  of  the  Two  Processes. 

We  will  now  use  some  elementary  estimates  to  show  that-  un  ~ 
With  the  usual  notation 


P(k}\)  =  |  vVVj! 

j=0 

and 

P(kj\ )  =  l-P(k-lj\  ) 


we  will  prove  the  inequalities: 


(5*5)  P([  (i-eXJjX)  <  w(\) 

and 

(3.6)  w(\)  <u[(l+e)x>]  P(((l+e)\];\)  +  \P([  (l+€)\;\)  . 

Utie  first  inequality  holds  since  the  optimal  policy  for  n  = 
observations  used  on  the  Poisson  process  (paying  no  heed  to  the  Z. 


w(n). 


[(l-€)\] 

’s) 
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would  yield  an  expected  length  of  at  least 
whenever  there  are  at  least  [(l-e)*.]  arrivals. 

For  the  second  inequality  let  N  be  the  number  of  arrivals  and 
let  L  be  the  subsequence  length  obtained  when  the  optimal  policy 
in  J  is  used.  We  then  have 

Li 


v(\)  =E(L|N<  [  (l+€)\]  )P([(1+€)\]j\) 

*e(e(l|h)i(k>[(1^)m)>  . 


We  note  E(l|n  <  [  (l+e)\]  )  <  ^+€)\]  E(L|n)  <  N. 

Inequality  (3.6)  then  follows  since 


From  the  fact  (3.M  that  w(\)  and  elementary  bounds  on  the 

Fbisson  distribution  one  now  easily  deduces  from  (3.5)  and  (3*6)  that 

un  ~  * 

We  remark  that  it  is  not  hard  to  extend  this  result  to  show  that  for  each 
t€(0,l]  one  has  un(t)//nt  ■*  7. 


10 


!*•.  A  Lower  Bound  for  the  Limit. 

For  any  constant  at  >  0,  and  for  each  n,  we  consider  the  policy 
x(n)  =  x(nja),  where 

^(n)  =  min{j:  X^  >  l-a//n) 

^(n)  =  min{j>Tk(n):  Xj  e  [X  (n)^n,  X  (n)))  . 

K  K 

We  shall  show  that  for  these  policies 


(t.D 


lim  inf  E  max{k:  x  (n)  <  n}  >  /n  min  (a, 2/a) 


n  oo 


The  right  side  of  (4,1)  is  maximized  for  a  =  which  shows  that 

lim  infn  ^  ^  u^/Vn  >  t/2  *  (Of  course  when  we  complete  the  proof  of 
the  aeorem,  (4.1)  with  a  =  ^2  will  imply  that  the  policies  x(n;  y^?) 
are  asymptotically  optimal,  ) 

What  makes  the  policies  x(n)  easy  to  evaluate  is  the  fact  that  if 

xT(n)>a/^n,  then  Vl(n)’Vn)  XT(n)~XT  (n)  are  conditionally 

k '  k  k+1 

independent  and  independent  of  (T^(n),...,T^(n);  Xt  (n)>’'*>xT  (n)^  * 

_  lk 

with  geometric  (p  =  ct/yiT)  and  uniform  on  (0 distributions 

respectively.  Hence  we  now  let  {Yk:  k  =  1,2,...}  and 

{Z^k  =  1,2,...)  be  independent  sequences  of  IID  random  variables  with 

these  geometric  and  uniform  distributions  respectively.  Also  set  Sk  =  Y^  + 

and  +  •  •  •  +  Zj^,  and  define 


' +  Y, 


Mr  =  max{k:  Sk  <  n  and  <  l-a//n}  . 
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r 


We  first  observe 


( k.2 )  EMn  <  E  maxlk:  Tk(n)  <  n)  • 

Now,  for  any  c  >  0,  Chebyshev’ s  inequality  gives 


/ 


1  +0(n-1^2) 

if 

k  <  (l-e)an1//2 

r(sk  <  »>  -  < 

o 

1 

H 

ro 

if 

k  >  (l+€)«n1//2 

- N 

H 

£ 

tT 

i 

H 

ro 

if 

k  <  (l-e)(2/a)n1//2 

P(S^  <  1-aVn)  -  < 

i 

j  0(n_l/2) 

if 

k  >  (l+e)(2/a)nly/2 

The  0(n”^2)  terms  are  uniformly  small  in  the  indicated  range,  so, 
by  the  independence  of  the  two  sequences, 

n  ,  /2 

EM  =  T  P(s  <  n)P(S*  <  l-a//n)  ~  n  '  min(a,2/a)  . 
n  k=l  K  K  “ 

Ey  (4.2),  this  shows  that  (l+.l)  holds,  completing  the  proof. 

5.  An  Upper  Bound  for  the  limit. 

Now  that  we  know  that  lim  un//n  exists  and  is  at  least  /2  ,  to 
complete  the  proof  of  the  Theorem  it  will  suffice  to  show  that 

(5.1)  lim  inf  vyVn  <^2  . 
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Our  proof  of  (5.1)  hinges  on  showing  that 


(5-2)  Un(t)/Vt  t  in  t  for  each  n  . 

Hie  deri/ative  of  u^tVyT  is 

fl/2{u;(t)-(2t)-1un(t)}  , 

so,  to  prove  (5*2),  we  must  show  that  for  each  te(0,l) 

(5-3)  un(t-»6) -un(t)  >  (|-)(6/t)un(t)  +  o(6)  as  6  I  0  . 

This  inequality  will  be  proved  by  selecting  a  suboptimal  member  of 
^as  defined  in  (2.1))  and  showing  that  this  policy  improves  on  the 
optimal  policy  in  ^  by  an  amount  equal  to  the  rigit  side  of  (5.3). 

What  we  actually  do  is  a  bit  more  complicated  than  this  and  involves 
showing  that  un(t)  is  also  the  maximal  expected  subsequence  length  in 
a  problem  where  the  number  of  available  observations  is  random  with  a 
binomial  distribution. 

5a.  Optimal  Selection  with  Binr>mift.ij.y  many  Observations. 

Since  the  policy  in  J  which  achieves  u  (t),  as  given  by 

t  n 

(2.8),  ignores  the  actual  values  of  all  X^'s  which  are  greater  than 
t,  and  since  the  other  X^'s  are,  conditionally,  IID  uniform  on  (0,t), 
we  could  just  as  well  replace  the  X^'s  by  a  sequence  of  observable  coin 
tosses  with  probability  t  of  heads,  letting  each  toss  which  gives  heads 
be  accompanied  by  the  next  in  a  sequence  of  IID  random  variables  uniform 
on  (0,t). 
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To  exploit  this  observation  let  and  X^,X^,. . .  be 

independent  sequence  of  IID  random  variables,  the  Y's  Bernoulli  (t), 
and  the  X's  uniform  on  (0,1 ).  Let  \ be  the  class  of  policies 
adapted  to  the  (Y^,X^)'s  which  select  a  monotone  decreasing  subse¬ 
quence  by  selecting  only  X^'s  for  vhich  Y.^  =  1  and  which  totally 
ignore  all  X^'s  for  which  Y^  =  0.  Then  for  the  un(t)  defined  by 
(2.2)  we  have  the  representation 


sup 


T  € 


“t 


E(max(k:  <  n))  =  u^t)  . 


Since  we  have  made  the  X^'s  uniform  on  (0,1)  rather  than 
on  (0,t)  —  this  is  to  avoid  confusion  in  what  follows  —  the  policy 
x  =  (t^,-^,...)  which  achieves  u^Ct)  becomes 


=  min{i:Y^=l  and  t*_^(t)  £  tX^<  t) 


tk+1  =  min{i  >  Tk:Yi=l  and  t*_i(tXT  )  <  tX1<tXT  ). 

k  k 

Now  suppose  we  introduce  a  second  coin  toss  at  each  stage  —  letting 
Y^,Yg, . . .  be  IID  Bernoulli  (p' )  and  independent  of  the  Y^'s  and  the 
X± ' s  —  and  we  allow  policies  adapted  to  the  (Y£ }  as  well,  but  maintain 
the  requirement  that  all  X^' s  for  which  Y^  =  0  must  be  ignored.  Then 
clearly  what  we  have  introduced  is  external  randomization;  those  policies 
which  depend  in  some  way  on  the  (Y£)  are  simply  randomized  policies, 
and,  of  course,  none  of  these  can  improve  on  the  best  non -randomized 
policies.  In  particular,  any  policy  which  ignores  all  X^'s  for  which 
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either  =  0  or  Y|  =  0  is  really  a  policy  in  j  hence  the 

expected  length  of  the  subsequence  of  X1,...,XQ  which  it  selects  is 
no  greater  than  un(tt’ ).  Just  such  a  policy  will  be  needed  in  proving 
(5-3). 

5b.  Monotonicity  of  un(t)//t  . 

We  now  fix  n  and  let  the  (Y^l  be  Bernoulli  (t+6 )  and- the 
(Y|)  be  Bernoulli  (t/(t+&)).  First  note  that  P(Yi  =  Y^  =  l)  =  t.  We 
consider  two  randomized  policies  t  and  t'  in  li.  Hie  first  is 
to  be  equivalent  to  the  optimal  policy  (for  given  n)  in  ,  while 
the  second  is  to  be  a  slight  modification  of  the  first.  Specifically, 
we  let  t  =  with 

I 

Tj,  =  min{i:Yi=Y^=l  and  t^Ct)  <  tX^ 


L 


Tk+1  =  min{i  >  Tk:Yi=Yi=1  and  t*_i(tX<r  )  <  tX±  <  tXT  } 

Tk  k 


We  want  x*  to  agree  with  t  up  to  the  first  i  <  n,  if  any,  at  which 
Y^  =  1,  Y^  =  0,  and  at  which  X^  would  have  been  selected  by  t  if 

Y|  had  been  1.  We  want  t*  to  select  this  Xi,  but  thereafter  to 
continue  to  behave  like  t.  We  thus  define 

I  =  minti:  Y.  = 1,  Y'  = 0,  t*  , (tX  )  <  tX,  <  tX  ] 
i  i  ’  n-i  a±  ~  1 


=  oo  if  no  such  i  <  n  , 


where 
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MMANUMiliKitiM 


0  (and  Xq  S  1)  if  >  i. 


We  then  let  t*  =  where 


Tk  "  Tk  if  Tk  <  1 


t1  -  T  if  T  <  T  <  T 

k  A  k  k+1 


Tk+i  3  10111(1  >  Tk:  Yi =  Yi  ■  i  “d  («T,)<txi<txTl)  if  t^>  I 

k  k 


Now,  for  convenience,  we  let  L  and  L'  be  the  lengths  of  the 
subsequences  of  X^,...,Xn  selected  by  t  and  t'  respectively. 
Then  L  =  L*  on  (I  >  n). 


EL  =  uQ(t)  , 


EL'  <  un(t+$) 


(5.**)  un(t-t6) -un(t)  >E(L'-L|l<n)  P(l<n) 


Furthermore,  from  (2.2)  and  the  definitions  of  t  and  t'. 


(5-5) 


E(L'-L|l=i,  X  =  x)  = 


l+Elu^  (tXA ) |  I=d,  X^*x )  -  uQ_i  (tx ) 


(Note:  (5*5)  is  valid  even  when  i=n,  since  we  have  set  Ug(t)  a  0. ) 

From  (5.*0  and  (5.5)  we  see  that  to  establish  (5*3)  it  suffices 
to  prove 

(5-6)  P(I  <  n)  <  (6/t)un(t)  +  o(6)  as  6  Jo 

and,  for  each  i  <  n, 

(5-7)  E(un_1(tXi)|l=i,  X^^)  >  un>>i(tx)-l/2  . 


To  prove  (5*6)  we  first  remark  that,  since  ♦ 1  as 

6  0,  we  have 

n 

(5.8)  P(I  <  n)  =  £  P(Y.=1,  Y!=0,  AJ  +  o(8) 

i=l 

where  A,  denotes  the  event  It*.  (tX  )  <  tX.  <  tX  } .  A.  is 
1  n-i  cr^  —  i  cr^  1 

independent  of  (Y^=l,  Y^=0)  so 


P(Y±=1,  Y£=0,  A±)  =  P(Yi=l,  Y^=0)  P(Ai) 
=  (6/t)  P(Yi=Y^=l,  Ai) 

=  (8/t)  P(XA  is  selected  by  t)  . 


Bitting  this  back  into  (5-  8)  we  have 


P(I  <  n)  =  (6/t)  £  PCXj  is  selected  by  x)  +  o(b) 


=  (6/t)  un(t)  +  o(6)  , 


\*ich  is  (5.6 ). 

To  prove  (5*7);  we  first  remark  that  the  conditioned  distribution 
of  tX^  given  X  =  i  and  =  x  is  uniform  on  (t*_^  (tx),tx) .  Also  we 

note  that 


(5.9) 


un-i(tn-i(tx))^  Vi^'1 


with  equality  holding  unless  uft  ^(tx)<l.  Hence  if  ^(*  )  were  linear  on 
the  interval  (t*_^  (tx),tx),  (5*7)  would  hold  and  would  in  fact  be  an  equal! 
if  un_j.(tx)>l.  So  the  most  natural  way  to  establish  (5-7 )  is  to  prove  the 
following  lemma: 

Lemma  5»ls  For  each  n,  un(* )  is  concave. 

Proof.  We  proceed  by  induction  and  .first  note  the  lemma  is  true  for 
n  =  1  because  u^(t)  =  t. 

Using  (2.7),  we  re-write  (2.5)  as 


un+l^  =  t+(l-t)un(t)  +  tn(t)(un(t)-l) 


Cm  "" 


(s)ds  . 


Formally  we  have 


u;+1(t)  =  l-un(t)+  (l-tju^tj  +  t*’  (t)(un(t)-l) 


+  t*(t)u.(t)  +  un(t)  -  t*'  (t)un(tj(t)) 


Now  u  (t*(t))  =  (u  (t)-l)+  and  t*'  (t)  =  0  if  u  (t)  <  1.  Hence 
nv  n  '  '  nv  n  n 

whether  or  not  un(t)  >  1  we  have 


un+l(t)  =  1  +  tl-(t-t*(t)))u^(t) 

and 


un+l(t)  =  . 

If  u^C* )  is  concave  (i.e. ,  <  0),  then  (t-t*(t))  is  increasing. 

Since,  t-t*(t)  <  1  and  u^(*  )  >  0,  we  conclude  that  u"  ^(t)  <  0,  i-e., 

u  ,.  (• )  is  concave. 
r.  +1 

Ihe  validity  of  the  foregoing  differentiations  are  also  easily 
established  by  induction.  To  begin  note  u^(t)  =  t  and  t*(t)  a  0. 

Next  the  differentiability  of  uR  and  implies  t  (• )  is 

differentiable  on  { t:  un(t)>l);  in  fact,  we  have 
t*’  (t)  =  u^(t)/u^(t*(t)).  By  means  of  (2.5)  one  even  more  easily 
sees  the  required  differentiability  of  un(*)* 

This  completes  the  proof  of  the  lemma,  from  which  we  obtain 
(5-7). 


5c.  Completion  of  the  Proof. 

At  last  we  are  ready  to  prove  (5-1).  We  define 

(5.10)  cn  =  un//n  =  un(l)//ii  , 

and  note  that  it  only  remains  to  sho*  lim  inf  cR  <J2.  By  (5.2), 
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(5.11) 


cn(nt)1//2  >  un(t)  t€(0,l)  • 


Abbreviate 


-  t;<i> 


and  define  sr  analogously  by 


so 


(5.12) 


c  (ns*)^2  =  c  /n  - 1 
n'  n'  n» 


s*  =  1  -  2C:Vl/2  +  c_2n*’1 


n 


Now  (5.10)  and  (5.11)  imply  that 


(5-15) 


*  * 
s  <  t 
n  —  n 


so,  re-writing  (2.5),  with  t  =  1,  as 


un+l  =  un  +  /.,  (un(t)-(Vl)Mt  > 
n 

we  conclude  from  (5.10),  (5.11),  and  (5.13)  that 

(5.14)  un+1  <  cftnl/2  +  f  (cn(nt)1/2-  (cnnl/2-l))dt 

~  8 


This  is  perhaps  the  central  inequality  in  the  proof,  and  it  is 
made  possible  by  (5*2).  The  remainder  of  the  proof  demands  only 
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straightforward  analytical  manipulation  of  the  right  side 
of  (5.1*0. 

Evaluating  the  right  side  of  (5.1**-)  we  get 
(5-15)  cn+l  -  cnnl/2  +  (2/5)enal/2(l-«^/2)-(l-*J)(cnnl/2-l)  . 

Substituting  (5.12)  and  the  Taylor  series  expansion 

H?*  -  (5/2 )(l -s*)-(3/8)(l-s*)2  +  O^l-s*)3)  , 
into  the  right  side  of  (5.15)  gives 

(5.16)  un+1  <  cnn1^2  +  C;V1/2  +  0  (n-1 ) 

=  (n+1  )1//2(  cnnl/2  (n+1 ) ~l/2  +  c  "1n'1'/2  (n+1 )  (n‘3/2 ) }  . 

Now  direct  computation  shows  that 

n1//2(n+l)‘1//2  =  1-  (1/2  -  &n  )/n1//2  (n+1  )1//2 
\diere  &n  >  0  and  6n  ■*  0  as  n  -*  00.  Hence  (5.16)  is 

un+l  $  (n+1  )l/2(cn+n‘1/2(n+l )“l/2  (c^-(|)cn  +  o(l))}  . 

From  (5.10),  with  n+1  instead  of  n,  we  have 

(5.17)  cn+1  <  cn  +  n-1/2(n+l)*l/2(c;1-  (|)cn  +  o(l))  . 


I 
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This  is  exactly  what  we  need  to  show  that  I  in  inf  cn  <  /2  and 
thereby  complete  the  proof. 

-1 

We  use  the  fact  that  c  -c/2  is  decreasing  in  c  and  zero  for 
c  =/2.  Choose  e  >  0  and  n£  large  enough  so  that  for  all  n  >  n£ 
the  o(l)  in  (5.17)  satisfies 

o(l)  <  (|)(/2  +  e)|  s6£. 

Then  for  all  n  >  n£, 

cn  >  +  c  cn+1  <  cQ  -  6  n "1/,2  (n+1  f1^2  . 


But 


so  c  >  J2  +  €  for  some  nu  >  n,  implies  c  __  <  +  c  for  some 

n0  0  -  €  nQ-hn 

Since  €  >  0  is  arbitrary  this  shows  that  lim  inf  cR  <  /2  as 
required  to  complete  the  proof. 

We  should  remark  that  exactly  the  same  argument  can  be  used  to 
prove  that 


u^tJAnt)1/2  -V2 


for  every  t  in  (0,1]. 


6.  Random  Sample  Size  and  an  Open  Problem, 

As  an  easy  consequence  of  ~  -/2n  one  can  obtain  several  results 
on  subsequence  selection  when  the  underlying  sample  size  N  is  random. 

In  particular  we  now  define  u^  by 

(6.1)  u^  =  sujD  E(max(k:  <  N) )  , 

where  jS  consists  of  those  strategies  adapted  to 
adapted  to  N.  When  N  is  Poisson  or  binomial  (with  fixed 
can  easily  show  that  as  EN  oo  we  have 

(6.2)  Ujj  . 

In  fact  one  can  check  that  the  same  result  holds  whenever  EN  oo  and 

Var  N  =  O(EN).  (To  compare  these  results  with  the  asymptotic  relations 

of  Sections  3a  and  5a  one  needs  to  note  that  the  class  of  policies  applied 

there  were  quite  different  from  those  used  in  (6.1)  since  they  were  also 

adapted  to  the  relevant  Poisson  or  Binomial  processes. ) 

We  now  consider  the  next  most  complex  case  where  N  has  the 

k-1 

geometrical  distribution  (P(N=k)  =  p(l-p)  ,  k=l>2>...).  The  condition 
Var  N  =  O(EN)  does  not  hold,  so  as  before  the  natural  analysis  begins 
with  dynamic  programming. 

Analogously  to  (2.2)  we  define 

%(t)  =  sup  E(max{k:  t.  <  N) )  . 


but  not 
p)  one 
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One  can  easily  check  in  this  case  that 


u»(p)(t)  =  uh(p')(1) 

where 

p’  =  p/{p+t(i-p)i  . 

If  we  define  f(p)  =  Ujj^(l)  we  are  led  to  a  single  integral  equation: 
f(p)  =  (1-p)  f  max{f(p),  l+f(l/{p+t(l-p)}  )}dt  . 

Jo 

As  we  similarly  noted  at  the  end  of  Section  2  we  can  solve  the  equation 
for  sufficiently  extreme  values.  In  this  case  we  know 

f(p)  =  log (p”1 )  if  p  >  e”1  . 

This  observation  just  says  that  if  f(p)  <1  the  optimal  policy  is  to 
select  all  record  values  exactly  as  one  does  in  the  fixed  sample  size 
problem  when  un(t)  <1. 

One  would  like  to  determine  the  asymptotic  behavior  of  f(p)  as 
P  0,  We  conjecture,  but  have  not  been  able  to  prove,  that  as  p  -»  0 

f(p)  ~  cp‘l/2 

for  a  constant 

c  <V2  . 
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7. 


A  Result  of  Burgess  Davis  on  Selections  from  Permutations. 

If  one  considers  a  random  permutation  of  the  set  {lf2,...,n}, 
then  the  distribution  of  the  length  of  the  longest  decreasing  subsequence 
is  the  same  as  that  in  a  random  sample  of  size  n  from  a  uniform  distri¬ 
bution.  In  contrast,  the  length  of  the  optimal  sequentially  selected 
decreasing  subsequence  is  stochastically  larger  in  the  first  case. 

We  let  £^  denote  the  expected  length  of  the  longest  decreasing 
subsequence  which  can  be  chosen  sequentially  from  a  random  permutation. 

The  main  result  of  this  paper  immediately  implies  that 

(7.1)  lim  inf  l  l-fn  >  -Jl  . 

n 

Part  of  the  interest  of  this  observation  stems  from  the  fact  that 

the  study  of  the  £^'s  was  already  a  primary  objective  in  Baer  and 

Brock  (1968),  where  "natural"  is  used  as  a  synonym  for  "sequential."  On 

the  basis  of  substantial  computation  Baer  and  Brock  even  conjectured  that 

£  %  /2n  .  The  truth  of  this  conjecture  is  an  immediate  consequence  of  our 

n 

main  result  u^  ^  /2n  and  the  previously  unpublished  theorem  due  to  Burgess 

Davis  which  is  proved  below. 

Theorem  (Burgess  Davis). 

£  ^  u  . 

n  n 

Proof.  First  suppose  X^,  1  £  i  £  00  ,  are  i.i.d.  uniform  on  [0,1]. 

For  any  t  >  0  and  0  <  6  <  1,  let  l£n^  denote  the  interval 
((k-l)eVn,  k£/Vn]  and  let  denote  the  cardinality  of  the  set 

(i:  1  <_  i  £  n,  Xi  e  l£n^}.  Consider  the  events 

A  -  {  min  Y^  >  (1-5)  zjn] 
n  l<k<A/c  k  “ 

and  note  that  elementary  binomial  estimates  show  P(An)  -*■  1  as  n  -*■  00  . 
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Now  assign  “pretend  ranks“  as  follows i 

If  e  l£n)  and  X±  is  one  of  the  first  [(1-6)  e/n]  elements 
of  we  chose  its  "pretend  rank"  at  random  from  those  integers 

between  (k-1) (l-6)e/n  and  k(l-6)e/n  which  are  not  already  assigned. 
We  then  ignore  all  X^'s  in  after  the  first  [(l-6)e/n]. 

We  note  that  on  the  sequence  of  “pretend  ranks'*  is  simply  a 

permutation  on  the  integers  (k:  1  <_  k  (l-<5)n). 

Now  use  the  optimal  random  permutation  policy  on  the  set  of 
"pretend  ranks"  to  select  a  subsequence  with  decreasing  pretend  ranks. 
Delete  from  the  subsequence  all  X^f s  which  are  not  smaller  than  all 
their  predecessors  in  the  subsequence.  This  gives  an  decreasing  sub¬ 
sequence* 

We  now  claim  that  the  expected  cardinality  of  the  resulting 

subsequence  is  at  least  P(A  )(Jl  -e/n)  with  m  *  [(l-6)n].  First 

n  m 

note*  on  An  the  expected  length  before  deletions  is  Let 

denote  the  interval  which  contains  the  smallest  selected  element 

k  - 

from  {X^,X2» . . . .  For  xi+^  to  an  observation  deleted  from  the 
selected  subsequence  it  is  necessary  that  X^+jej£n\  By  Boole's 
inequality  and  the  Independence  of  X^+^  and  we  have  that  the 

expected  number  of  deleted  observations  is  at  most 


This  proves  l  // n  <  P(A  )  1  u  //n  +  e  .  By  (7.1)  and  the  arbitrariness 
m  —  n  n 

of  £  and  6  the  theorem  is  proved. 
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